Overview

Dataset statistics

Number of variables9
Number of observations500
Missing cells40
Missing cells (%)0.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory35.3 KiB
Average record size in memory72.3 B

Variable types

NUM8
BOOL1

Warnings

GRE Score has 15 (3.0%) missing values Missing
TOEFL Score has 10 (2.0%) missing values Missing
University Rating has 15 (3.0%) missing values Missing
Serial No. has unique values Unique

Reproduction

Analysis started2022-10-14 10:38:33.919061
Analysis finished2022-10-14 10:38:44.815211
Duration10.9 seconds
Software versionpandas-profiling v2.9.0
Download configurationconfig.yaml

Variables

Serial No.
Real number (ℝ≥0)

UNIQUE

Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean250.5
Minimum1
Maximum500
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:44.884815image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25.95
Q1125.75
median250.5
Q3375.25
95-th percentile475.05
Maximum500
Range499
Interquartile range (IQR)249.5

Descriptive statistics

Standard deviation144.4818328
Coefficient of variation (CV)0.5767737835
Kurtosis-1.2
Mean250.5
Median Absolute Deviation (MAD)125
Skewness0
Sum125250
Variance20875
MonotocityStrictly increasing
2022-10-14T06:38:45.007195image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
50010.2%
 
17110.2%
 
15810.2%
 
15910.2%
 
16010.2%
 
16110.2%
 
16210.2%
 
16310.2%
 
16410.2%
 
16510.2%
 
Other values (490)49098.0%
 
ValueCountFrequency (%) 
110.2%
 
210.2%
 
310.2%
 
410.2%
 
510.2%
 
ValueCountFrequency (%) 
50010.2%
 
49910.2%
 
49810.2%
 
49710.2%
 
49610.2%
 

GRE Score
Real number (ℝ≥0)

MISSING

Distinct49
Distinct (%)10.1%
Missing15
Missing (%)3.0%
Infinite0
Infinite (%)0.0%
Mean316.5587629
Minimum290
Maximum340
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:45.110550image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum290
5-th percentile298
Q1308
median317
Q3325
95-th percentile335
Maximum340
Range50
Interquartile range (IQR)17

Descriptive statistics

Standard deviation11.2747043
Coefficient of variation (CV)0.03561646565
Kurtosis-0.6844666911
Mean316.5587629
Median Absolute Deviation (MAD)8
Skewness-0.05168658259
Sum153531
Variance127.1189571
MonotocityNot monotonic
2022-10-14T06:38:45.209214image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%) 
324224.4%
 
312224.4%
 
321173.4%
 
316173.4%
 
327173.4%
 
322173.4%
 
314163.2%
 
320163.2%
 
311163.2%
 
317153.0%
 
Other values (39)31062.0%
 
ValueCountFrequency (%) 
29020.4%
 
29310.2%
 
29420.4%
 
29551.0%
 
29651.0%
 
ValueCountFrequency (%) 
34091.8%
 
33930.6%
 
33840.8%
 
33720.4%
 
33651.0%
 

TOEFL Score
Real number (ℝ≥0)

MISSING

Distinct29
Distinct (%)5.9%
Missing10
Missing (%)2.0%
Infinite0
Infinite (%)0.0%
Mean107.1877551
Minimum92
Maximum120
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:45.297835image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum92
5-th percentile98
Q1103
median107
Q3112
95-th percentile118
Maximum120
Range28
Interquartile range (IQR)9

Descriptive statistics

Standard deviation6.112899387
Coefficient of variation (CV)0.0570298294
Kurtosis-0.6645653663
Mean107.1877551
Median Absolute Deviation (MAD)5
Skewness0.1020677321
Sum52522
Variance37.36753892
MonotocityNot monotonic
2022-10-14T06:38:45.380090image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=29)
ValueCountFrequency (%) 
110428.4%
 
105377.4%
 
104295.8%
 
107285.6%
 
112275.4%
 
106265.2%
 
103255.0%
 
102244.8%
 
100244.8%
 
99224.4%
 
Other values (19)20641.2%
 
ValueCountFrequency (%) 
9210.2%
 
9320.4%
 
9420.4%
 
9530.6%
 
9661.2%
 
ValueCountFrequency (%) 
12091.8%
 
119102.0%
 
118102.0%
 
11781.6%
 
116163.2%
 

University Rating
Real number (ℝ≥0)

MISSING

Distinct5
Distinct (%)1.0%
Missing15
Missing (%)3.0%
Infinite0
Infinite (%)0.0%
Mean3.121649485
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:45.461028image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.146160209
Coefficient of variation (CV)0.3671649281
Kurtosis-0.8340454487
Mean3.121649485
Median Absolute Deviation (MAD)1
Skewness0.0910567357
Sum1514
Variance1.313683224
MonotocityNot monotonic
2022-10-14T06:38:45.533267image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=5)
ValueCountFrequency (%) 
315430.8%
 
212424.8%
 
410320.6%
 
57214.4%
 
1326.4%
 
(Missing)153.0%
 
ValueCountFrequency (%) 
1326.4%
 
212424.8%
 
315430.8%
 
410320.6%
 
57214.4%
 
ValueCountFrequency (%) 
57214.4%
 
410320.6%
 
315430.8%
 
212424.8%
 
1326.4%
 

SOP
Real number (ℝ≥0)

Distinct9
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.374
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:45.609287image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1.5
Q12.5
median3.5
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)1.5

Descriptive statistics

Standard deviation0.9910036208
Coefficient of variation (CV)0.2937177299
Kurtosis-0.7057169536
Mean3.374
Median Absolute Deviation (MAD)0.5
Skewness-0.2289723963
Sum1687
Variance0.9820881764
MonotocityNot monotonic
2022-10-14T06:38:45.681068image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
48917.8%
 
3.58817.6%
 
38016.0%
 
2.56412.8%
 
4.56312.6%
 
2438.6%
 
5428.4%
 
1.5255.0%
 
161.2%
 
ValueCountFrequency (%) 
161.2%
 
1.5255.0%
 
2438.6%
 
2.56412.8%
 
38016.0%
 
ValueCountFrequency (%) 
5428.4%
 
4.56312.6%
 
48917.8%
 
3.58817.6%
 
38016.0%
 

LOR
Real number (ℝ≥0)

Distinct9
Distinct (%)1.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.484
Minimum1
Maximum5
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:45.755739image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median3.5
Q34
95-th percentile5
Maximum5
Range4
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9254495739
Coefficient of variation (CV)0.2656284655
Kurtosis-0.7457485106
Mean3.484
Median Absolute Deviation (MAD)0.5
Skewness-0.1452903146
Sum1742
Variance0.8564569138
MonotocityNot monotonic
2022-10-14T06:38:45.829479image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=9)
ValueCountFrequency (%) 
39919.8%
 
49418.8%
 
3.58617.2%
 
4.56312.6%
 
55010.0%
 
2.55010.0%
 
2469.2%
 
1.5112.2%
 
110.2%
 
ValueCountFrequency (%) 
110.2%
 
1.5112.2%
 
2469.2%
 
2.55010.0%
 
39919.8%
 
ValueCountFrequency (%) 
55010.0%
 
4.56312.6%
 
49418.8%
 
3.58617.2%
 
39919.8%
 

CGPA
Real number (ℝ≥0)

Distinct184
Distinct (%)36.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8.57644
Minimum6.8
Maximum9.92
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:45.923170image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum6.8
5-th percentile7.638
Q18.1275
median8.56
Q39.04
95-th percentile9.6
Maximum9.92
Range3.12
Interquartile range (IQR)0.9125

Descriptive statistics

Standard deviation0.6048128003
Coefficient of variation (CV)0.07052026253
Kurtosis-0.5612783981
Mean8.57644
Median Absolute Deviation (MAD)0.46
Skewness-0.02661251732
Sum4288.22
Variance0.3657985234
MonotocityNot monotonic
2022-10-14T06:38:46.119225image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
891.8%
 
8.7691.8%
 
8.5471.4%
 
8.4571.4%
 
8.5671.4%
 
8.1271.4%
 
7.8861.2%
 
8.6461.2%
 
8.6661.2%
 
9.1161.2%
 
Other values (174)43086.0%
 
ValueCountFrequency (%) 
6.810.2%
 
7.210.2%
 
7.2110.2%
 
7.2310.2%
 
7.2510.2%
 
ValueCountFrequency (%) 
9.9210.2%
 
9.9110.2%
 
9.8720.4%
 
9.8610.2%
 
9.8210.2%
 

Research
Boolean

Distinct2
Distinct (%)0.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
1
280 
0
220 
ValueCountFrequency (%) 
128056.0%
 
022044.0%
 
2022-10-14T06:38:46.209041image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Chance of Admit
Real number (ℝ≥0)

Distinct61
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.72174
Minimum0.34
Maximum0.97
Zeros0
Zeros (%)0.0%
Memory size3.9 KiB
2022-10-14T06:38:46.297312image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Quantile statistics

Minimum0.34
5-th percentile0.47
Q10.63
median0.72
Q30.82
95-th percentile0.94
Maximum0.97
Range0.63
Interquartile range (IQR)0.19

Descriptive statistics

Standard deviation0.141140404
Coefficient of variation (CV)0.1955557458
Kurtosis-0.4546817998
Mean0.72174
Median Absolute Deviation (MAD)0.1
Skewness-0.28996621
Sum360.87
Variance0.01992061363
MonotocityNot monotonic
2022-10-14T06:38:46.417571image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%) 
0.71234.6%
 
0.64193.8%
 
0.73183.6%
 
0.72163.2%
 
0.79163.2%
 
0.78153.0%
 
0.76142.8%
 
0.8132.6%
 
0.7132.6%
 
0.94132.6%
 
Other values (51)34068.0%
 
ValueCountFrequency (%) 
0.3420.4%
 
0.3620.4%
 
0.3710.2%
 
0.3820.4%
 
0.3910.2%
 
ValueCountFrequency (%) 
0.9740.8%
 
0.9681.6%
 
0.9551.0%
 
0.94132.6%
 
0.93122.4%
 

Interactions

2022-10-14T06:38:38.375216image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:38.486228image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:38.587632image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:38.680946image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:38.785259image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:38.871558image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:38.952495image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.129539image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.219617image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.301555image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.376466image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.453680image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.536610image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.636719image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.722563image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.801274image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.881078image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:39.957737image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.034551image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.114364image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.198209image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.284400image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.366825image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.446924image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.530248image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.620710image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.710804image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.810357image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:40.903911image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.004668image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.090928image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.179347image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.284802image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.394791image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.632762image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.753910image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.879534image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:41.974994image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.062501image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.148777image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.236757image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.324271image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.411690image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.497477image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.586616image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.674328image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.763304image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.852479image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:42.939718image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.020445image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.100179image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.187109image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.273930image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.349476image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.428315image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.503397image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.580675image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.656336image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.733155image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.808897image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:43.987057image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:44.064784image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:44.144699image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:44.222112image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Correlations

2022-10-14T06:38:46.508362image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
2022-10-14T06:38:46.627838image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
2022-10-14T06:38:46.755893image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
2022-10-14T06:38:46.878422image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

2022-10-14T06:38:44.363971image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:44.491465image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:44.616062image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/
2022-10-14T06:38:44.686544image/svg+xmlMatplotlib v3.4.3, https://matplotlib.org/

Sample

First rows

Serial No.GRE ScoreTOEFL ScoreUniversity RatingSOPLORCGPAResearchChance of Admit
01337.0118.04.04.54.59.6510.92
12324.0107.04.04.04.58.8710.76
23NaN104.03.03.03.58.0010.72
34322.0110.03.03.52.58.6710.80
45314.0103.02.02.03.08.2100.65
56330.0115.05.04.53.09.3410.90
67321.0109.0NaN3.04.08.2010.75
78308.0101.02.03.04.07.9000.68
89302.0102.01.02.01.58.0000.50
910323.0108.03.03.53.08.6000.45

Last rows

Serial No.GRE ScoreTOEFL ScoreUniversity RatingSOPLORCGPAResearchChance of Admit
490491307.0105.02.02.54.58.1210.67
491492297.099.04.03.03.57.8100.54
492493298.0101.04.02.54.57.6910.53
493494300.095.02.03.01.58.2210.62
494495301.099.03.02.52.08.4510.68
495496332.0108.05.04.54.09.0210.87
496497337.0117.05.05.05.09.8710.96
497498330.0120.05.04.55.09.5610.93
498499312.0103.04.04.05.08.4300.73
499500327.0113.04.04.54.59.0400.84